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Protein L and Hybrid Proteins Thereof, 

I; 

The present invention relates to sequences of protein L 

which bind to light chains of immunoglobulins. The 

5 invention also relates to hybrid proteins of protein L 

having the ability to bind to light chains of all Ig and **k> 

also to bind to light and heavy chains of immunoglobulin JH$*-' 

G, DNA-sequences which code for the proteins vectors k 

that contain such DNA-sequences, host cells transformed Mv 

10 by the vectors, methods for preparing the proteins, 

reagent apparatus for separating and identifying immuno- 

globulins, compositions and pharmaceutical compositions 

which contain the proteins. |>v>;" 

***• 

15 The invention relates in particular to the DNA-sequence KrW 

and to the amino acid sequence of the light-chain form- 
ing domains of protein L. 

Proteins which bind to the constant domains (of high *s$r 

20 affinity) of the immunoglobulins (Ig) are known. Thus, itfefr 

protein A (from Staphylococcus aureus ) (Forsgren, A. and 

Sjoquist, J. (1966) Protein A from Staphylococcus \V* V 

aureus. I. Pseudo-immune reaction with human gamma- WVV- 
globulin. J. Immunol. 97: 822-827) binds to IgG from 

2 5 various mammal species. The binding of protein A to IgG J J* 

is mediated essentially via surfaces in the Fc-fragment *'*v"^ 

of the heavy chain of the IgG-molecule , although a fs$rs[ 

certaxn bond is also effected with surfaces in the Fab- JllAV 
fragment of the IgG. Protein A lacks the ability of 

3 0 binding to human IgG3 and neither will it bind to IgG n-"v^ 

from several other animal species , such as important •NrX?-"- 

laboratory animals, for instance rats and goats, which j>^ . 

limits the use of protein A. ■» 

35 Protein G (Bjdrck, L. and Kronvall, G. (1984) Purifica- 

tion and some properties of streptococcal protein G, a 
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novel IgG-binding reagent. J. Immunol, 133: 969-974; 
Reis, K., Ayoub, E. and Boyle, M. (1984) Streptococcal 
Fc receptors. I. Isolation and partial characterization 
of the receptor from a group C streptococcus* J. 
Immunol. 132: 3091-3097) binds to heavy chains in human 
IgG and to all four of its subclasses and also to IgG 
from most mammals, including rats and goats. 



15 break the bond with a weak agent, for instance when 



Protein M {Applicant's Patent Application PCT/SE 
20 91100447) binds to the Fc-fragment in IgG from humans, 

monkeys, rabbits, goats, mice and pigs. 



As 



Protein H (Akesson, P., Cooney, J., Kishimoto, F. and 
10 Bjorck, L. (1990) Protein H - a novel IgG binding bacte- ^ 

rial protein. Molec. Immun. 27: 523-531) binds to the 



Fc-fragment in IgG from human beings, monkeys and rab- ^© 
bits. However, the bond is weaker than in the case of j^r* 
protein G and A, which may be beneficial when wishing to 

m 

purifying proteins which are readily denatured with the 

aid of antibodies. JcVf" 

UK ' 



fix 

fi>* 



Protein L {Bjorck, L. (1988) Protein L, a novel bacteri- 

al cell wall protein with affinity to Ig L chains. J. &^ 

25 Immunol. 140: 1194-1197), which binds to the light 7^ 

-'C^- 
chains in immunoglobulins from all of the classes G, A, ^K- 

M, D and E is known (USP 4,876,194). The amino acid se- <%X> 

quence and the binding domains of this protein, however, £^ 

have hitherto been unknown. - 

30 h$ 

The aforesaid proteins can be used in the analysis, 
purification and preparation of antibodies and for 
diagnostic and biological research. 

3 5 The elimination of immunoglobulins, with the aid of Vv* 

S v 5v 

stevi. 
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plasmapheresis, can have a favourable effect on some 
autoimmune diseases. A broadly binding protein would be 
an advantage when wishing to eliminate all classes of 
antibodies in this context. 

It has long been known that infectious conditions can be 
prevented or cured with the introduction of an immune 
serum, i.e. a serum which is rich in antibodies against 
the organism concerned or its potentially harmful pro- 
duct. Examples hereof are epidemic jaundice, tetanus, 
diphtheria, rabies and generalized shingles. Antibodies 
against a toxic product may also be effective in the 
case of non-infectious occasioned conditions. Serum 
produced in animals against different snake venoms is 
the most common application in this respect. However, 
the administration of sera or antibody preparations is 
not totally without risk. Serious immunological reac- 
tions can occur in some cases. Singular cases of the 
transmission of contagious diseases, such as HIV and 
hepatitis through the agency of these products have also 
been described. In order to avoid these secondary ef- 
fects, it has been desirable to produce therapeutic 
antibodies in test tubes- A large number of novel tech- 
niques for the preparation of antibodies in test tubes 
have been proposed in recent years. Examples of such 
techniques are hybridom techniques, synthesis of chima- 
antibodies and the preparation of antibodies in bacte- 
ria. These techniques also enable antibodies to be 
specially designed which can further widen the use of 
such molecules as therapeutics, for instance in the case 
of certain tumour-diseases. In the case of some of these 
novel methods, however, the product totally lacks the 
Fc-fragment to which all of the described IgG-binding 
proteins, with the exception of protein L, bind. There 
is consequently a need of a process for purifying anti- 
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bodies for therapeutic use, wherein proteins which have 
a broad binding activity/specificity, can be of value. 



m 



7W 

is 

i 



It has long been possible to utilize the antibody reac- 
5 tion with its high grade specificity for diagnosing past 
or, in some cases, ongoing infections with different 
parasites. This indirect method of indicating infectious 
agents is called serology and, in many cases, may be the 
only diagnostic alternative. In certain cases, it can 

10 also be of interest to exhibit specific IgE- or IgA- 
antibodies. When diagnosing with the aid of serology, 
the antigen is most often fastened to a solid phase, 
whereafter serum taken from the patient is incubated 
with the antigen. Antibodies that have been bound from 

15 the patient can then be detected in different ways, 

often with the aid of a secondary antibody (for in- 
stance, an antibody which is directed against the light 
chains of human antibodies) to which an identifiable 
label has been attached, such as alkaline phosphatase, 

20 biotin, radioactive isotopes, fluorescein, etc. In this 

context, a protein having a broad Ig binding capacity 
can be used as an alternative to secondary antibodies. 

There are a number of non-therapeutic and non-diagnostic 
25 reasons for the necessity to bind antibodies. Antibodies 

are often used in research, both for detection and for 
purifying the antigen against which they are directed. 
All techniques which facilitate the purification of 
antibodies and, in particular, techniques which enable 
30 different classes to be purified, are of interest in 

this context. 
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Consequently, there is a serious need of a protein which 
has a broad binding activity/specificity and which binds 
to several different classes of immunoglobulins from 
different animal species. At present, there is no known 
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protein which will bind to all immunoglobulin classes. 
The earlier known proteins A, G, H and M bind only to 
heavy chains in IgG. The known protein L (Bjorck et al, 
1988) binds to the light x-c ha * ns and 7-chains in im- 
5 munoglobulins of all classes, although the bonds are 

much weaker on the x-chains. Applicant has charted pro- 
tein L, has determined the amino acid sequence for 
protein L, has identified the light-chain binding do- 
mains on protein L, and has used these to produce hybrid 

10 proteins which possess the IgG-Fc-binding domains of 

protein G. The Applicant is able to show through protein 
LG that a protein of broader binding activity/ 
specificity can be produced thereby. The aforesaid 
proteins A, G, H and M bind to the same surfaces, or to 

15 very closely lying surfaces on IgG-Fc. The protein L 
which binds to light chains can thus be combined with 
any other functionally similar protein which binds to 
the Fc-fragment of heavy chains. A similar broadening of 
the Ig-binding activity is achieved with all alterna- 

20 tives. 

Thus, the present invention relates to the sequence of 
protein L which binds to light chains in Ig and has the 
amino acid sequence disclosed in Figure l, and variants, 
25 subf ragments , multiples or mixtures of the domains B1-E5 

having the same binding properties. The invention also 
relates to a DNA-sequence which codes for such protein 
sequences, for instance the DNA-sequence in Figure 1. 

3 0 The invention is concerned with a hybrid protein which 

is characterized by comprising domains which bind to the 
light x~ c hains and X-chains in immunoglobulins of all 
classes, and also comprises domains which bind to heavy 
chains in immunoglobulin G, wherein those domains which 

3 5 bind to the light chains are chosen from among the B1-, 

B2-, B3-, B4- and B5-domains in protein L (see Claim 1) 
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and those domains which bind to heavy chains of immuno- 
globulins are chosen from the C1-, C2- and C3-domains in 
protein G; the A-, B- and Cl-domains from protein H; the 
A-, B1-, B2- and S-domains in protein Ml or the 
5 A-, B- and C-domains in protein A (see Figure 6) and 

variants, subf ragments , multiples or mixtures of these 
domains that have the same binding properties which bind 
to heavy chains of immunoglobulins. 

By subfragment is meant a part-fragment of the given 
domains or fragments which include parts from the vari- 
ous domains having mutually the same binding properties. 
By variants is meant proteins or peptides in which the 
original amino acid sequence has been modified or 
changed by insertion, addition, substitution, inversion 
or exclusion of one or more amino acids, although while 
retaining or improving the binding properties. The 
invention also relates to those proteins which contain 
several arrays (multiples) of the binding domains or 
mixtures of the binding domains with retained binding 
properties. The invention also relates to mixtures of 
the various domains of amino acid sequences having 
mutually the same binding properties. 

25 The invention relates in particular to a hybrid protein 

designated LG, and is characterized in that the hybrid 
protein includes the B-domains in protein L which bind 
to the light chains in immunoglobulins, and the Cl- 
domains and C2-domains in protein G which bind to heavy 

3 0 chains and have the amino acid sequence disclosed in 

Figure 3. The invention also relates to variants, sub- 
fragments, multiples or mixtures of these domains. 



15 



Protein LG is a hybrid protein having a molecular weight 
35 of about 50 3cDa (432 amino acids) and comprising four 

domains, each of which binds to light chains in immuno- 
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globulins, and two IgG-binding domains from protein G. 

The hybrid protein combines a broad IgG-bmdxng activi- w>*-; 

ty, deriving from the high-grade binding ability of V£< 

protein G to the Fc-fragment of the heavy chain on IgG i—i 

5 with the ability of the protein L to bind to light 

chains of all classes of immunoglobulins. Thus, protein 0 

LG binds polyclonal human IgG # IgM, IgA, IgD and IgE. 

10 -1 tHJte. 

The affinity for human polyclonal IgG is 2 x 10 M . mmtL 

All four human immunoglobulin classes are bound. Binding 

10 to human IgG is effected with both the x-and the X- 



35 



chain. Both the Fc-fragment and the Fab-fragment of IgG 



with elektroblotting. The protean can be immobilized on 



m 

are bound to the hybrid protein. The protein also binds . , % 

human IgA-, IgD-, IgE- and IgM-antibodies. The bond is 
stronger to human immunoglobulins which carry x than to 
15 those which carry the X-isotope of light chains. IgG 

from most mammals will be bound by protein LG, thus also 
IgG from goats and cows, which do not bind to protein L. 
However, rabbit-IgG which binds relatively weakly to 



protein L will bind well to the fusion protein. IgM and ?Q< 
20 IgA-antibodies from mice, rats and rabbits will be bound 
to the protein- 

Protein LG is highly soluble. It is able to withstand 
heat and will retain its binding properties even at high 

25 temperatures. The binding properties also remain in a JEJTa 

broad pH-range of 3-10. The protein withstands aetergent fty**' 

and binds marked or labelled proteins subsequent to Jvj£; 

separation in SDS-PAGE and transference to membranes >VgW 



3 0 a solid phase (nitrocellulose, Immobilon®, polyacryl- ^.y, 



v* - 

amide, plastic, metal and paper) without losing its v£ 
binding capacity. The binding properties are not influ- j& 
enced by marking with radioactive substances, biotin or >VkV!>" 
alkaline phosphatase. (The binding abilities of the ^V£v 
protein LG are disclosed in Example 3) . 
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The protein comprises 432 amino acids and has a molecu- 
lar weight of 50 kDa deriving therefrom. The sequence is 
constructed of an ala sequence of the three last amino 
acids in the A-domain of the protein L (val-glu-asn) , 
5 this ala sequence being unrelated to the two proteins, 
whereafter the four mutually high-grade homologous B- 
domains from protein L follow. The first of the B-do- 
mains is comprised of 76 amino acids, and the remaining 
domains are each comprised of 72 amino acids. The first 

10 nine amino acids from the fifth B-domain are included 
and followed by two non-related amino acids (pro-met) . 
The protein G-sequences then follow. The last amino acid 
in the so-called S-domain from protein G is followed by 
an IgG-binding domain from protein G (CI; 55 amino 

15 acids) , the intermediate D-region (15 amino acids) and 
the second IgG-binding C-domain (C2; 55 amino 
acids) . The last amino acid is a methionine, which 
occurs in natural protein G as the first amino acid in 
the so-called W-region. 

20 

The invention also relates to DNA-sequences which code 
for the aforesaid proteins. 

The gene which codes for the IgG-binding amino acid 
25 sequences can be isolated from the chromosomal DNA from 

Staphylococcus aureus based on the information on the 
DNA-sequence for protein A (S. Lofdahl, B. Guss, M. 
Uhlen, L. Philipsson and M. Lindberg. 1983. Gene for 
staphylococcal protein A. Proc. Natl. Acad. Sci. USA. 
30 80: 697-701) and Figure 6, or from G-streptococcus , 

preferably strain G 148 or C-streptococcus , preferably 
strain Streptococcus equisimilis C 40 f based on the 
information on protein G (B. Guss, M. Eliasson, 
A. Olsson, M. Uhlen, A.-K. Frej, H. Jorvall, I. Flock 
3 5 and M. Lindberg. 1986. Structure of the IgG-binding 
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regions c streptococcal protein G. EMBO* J. 5: 1567- 
1575) and Figure 6, or from group A-streptococcus , 
e.g. S . pyogenes (type Ml) based on the information on 
the DNA-seguence for protein H (H. Gomi, T. Hozumi, 
S. Hattori, C. Tagawa, F. Kishimoto and L. Bjbrck. 1990. 
The gene sequence and some properties of protein H - a 
novel IgG binding protein J- Immunol. 144: 4046-4052) 
and Figure 6, or from the chromosomal DNA in group A- 
streptococcus type Ml based on the information on the 
DNA-sequence for protein M (Applicant's Patent Applica- 
tion, PCT/SE 91100447) and Figures 6 and 7. The gene 
which codes for the protein that binds to light chains 
can be isolated from the chromosomal DNA from Pepto-co- 
ccus maanus 312 based on the information on the DNA- 
seguence for protein L in claim 2. 

By using the chromosomal DNA' t obtained from the afore- 
said bacteria as a template, a DNA-fragment defined with 
the aid of two synthetic oligonucleotides can then be 
specifically amplified with the aid of PCR (Polymerase 
Chain Reaction) . This method also enables recognition 
sites to be incorporated for restriction enzymes in the 
ends of the amplified fragments (PCR technology, Ed: PCR 
Technology. Principles and Applications for DNA Amplifi- 
cation. Ed. Henry Erlich. Stockton Press, New York, 
198 9) . The choice of recognition sequences can be adapt- 
ed in accordance with the vector chosen to express the 
fragment or the DNA-fragment or other DNA- fragments with 
which the amplified fragment is* intended to be combined. 
The amplified fragment is then cleaved with the restric- 
tion enzyme or enzymes concerned and is combined with 
the fragment/ the other fragments concerned and the 
fragments are then cloned together in the chosen vector 
(in this case, the expression vector) (Sambrook, J.E. 
Fritsch and T. Maniatis, 1989, Molecular cloning: A 
laboratory manual, 2nd Ed. Cold Spring Harbor Laborato- 
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ries, Cold Spring Harbor, New York, USA). The plasmid 
vector pHD3l3 can be used (Dalboge, H. E. Bech Jensen, 
H. Tottrup, A. Grubb, M. Abrahamscn, I. Olafsson and S. 
Carlsen, 1989. High-level expression of active human 
cystatin C in Escherichia coli. Gene, 79: 325-332), 
alternatively one of the vectors in the so-called PET- 
series (PET 20, 21 , 22, 23) retailed by Novagen (Madi- 
son, Wisconsin, USA). 



10 
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The hybrid proteins are then incorporated in an appro- 
priate host, preferably E. coli. The invention also 
relates to such hosts as those in which the hybrid 
proteins are incorporated. 

Those clones which produce the desired proteins can be 
selected from the resultant transf ormants with the aid 
of a known method (Fahnestock et al., J. Bacteriol. 167, 
870 (1986). 

When the proteins that can bind to the light chains in 
the immunoglobulins and to the heavy chains in IgG have 
been purified from the resultant positive clones with 
the aid of conventional methods, the binding specifici- 
ties of the proteins are determined for selection of 
those clones which produce a protein that will bind to 
the light chains in immunoglobulins and to the heavy 
chains in IgG. 

Subsequent to having isolated plasmid DNA't in said 
clone with conventional methods, the DNA-seguence in the 
inserted material is determined with known methods 
(Sanger et al. , Proc. Natl. Acad. Sci. USA 74, 5463 
(1977) . 
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The invention also relates to DNA-sequences which hy- 
bridize with said identified DNA-sequences under conven- 



I 
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tional conditions and which code for a protein that 
possesses ne desired binding properties. Strict hybrid- 
izing conditions are preferred. 

5 Expression of the genes can be effected with expression 

vectors which have the requisite expression control 
regions, the structural gene being introduced after said 
regions. As illustrated in Figure 1 and Claim 2, the 
structural gene can be used for protein LG or other 
10 hybrid proteins with protein L. 

With regard to expression vectors, different host-vec- 
tor-systems have been developed, of which the most 
suitable host-vector-systems can be selected for expres- 
15 sion of the genes according to the present invention. 

The present invention also relates to a method of pro- 
ducing the inventive hybrid proteins by cultivating a 
host cell which is transformed with an expression vector 
2 0 in which DNA't which codes for the proteins according to 

the invention is inserted. 

This method includes the steps of 

25 (l) inserting into a vector a DNA-fragment which codes 

for the hybrid proteins; 

(2) transforming the resultant vector into an appropri- 
ate host cell; 

30 

(3) cultivating the resultant, transformed cell for 
preparation of the desired hybrid protein; and 

(4) extracting the protein from the culture. 
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In the first step, the DNA- fragment which codes for the 
hybrid protein is inserted in a vector which is suitable 
for the host that is to be used to express the hybrid 
protein. The gene can be inserted by cleaving the vector 
with an appropriate restriction enzyme, and then legat- 
ing the gene with the vector. 

In the second step, the vector with the hybrid plasmid 
is inserted into host cells. The host cells may be 
Escherichia, coli. Bacillus subtilis or Saccharomvces 
cerevisiae or other suitable cells. Transformation of 
the expressions hybrid vector into the host cell can be 
effected in a conventional manner and clones which have 
been transformed can then be selected. 

In the third step, the obtained transf ormants are culti- 
vated in an appropriate medium for preparation of the 
desired proteins by expression of the gene coded for the 
hybrid protein . 

In the fourth step, the desired protein is extracted 
from the culture and then purified. This can be achieved 
with the aid of known methods. For instance, the cells 
can be lysed with the aid of known methods, by treating 
the cells with ultrasonic sound, enzymes or by mechani- 
cal degradation. The protein which is released from the 
cells or which excretes in the medium can be recovered 
and purified with the aid of conventional methods often 
applied within the biochemical Meld, such as ion-ex- 
change chromatography, gel filtration, affinity chroma- 
tography with the use of immunoglobulins as ligands, 
hydrophobic chromatography or reverse-phase chromato- 
graphy. These methods can be applied individually or in 
suitable combinations. 
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As before mentioned, the inventive proteins may be used 
for binding, identifying or purifying immunoglobulins. 
They can also be bound to pharmaceuticals and used in 
formulations which have delayed release properties. To 
this end, the protein may be present in a reagent appli- 
ance for pharmaceutical composition in combination with 
appropriate reagents, additives or carriers. 

The proteins can be handled in a freeze-dried state or 
in a PBS-solution (phosphate-buffered physiological salt 
solution) pH 7.2 with 0.02% NaN^ . It can also be used 
connected to a solid phase, such as carbohydrate-based 
phases, for instance CNBr-activated sepharose, agarose, 
plastic surfaces, polyacrylamide, nylon, paper, magnetic 
spheres, filter, films. The proteins may be marked with 
biotin, alkaline phosphatase, radioactive isotopes, 
fluorescein and other fluorescent substances, gold 
particles, ferritin, and substances which enable lumi- 
nescence to be measured. 

Other proteins may also be used as carriers. These 
carriers may be bound to or incorporated in the pro- 
teins, in accordance with the invention. For instance, 
it is conceivable to consider the whole of proteins A, 
G, H, M as carriers for inserted sequences of protein L 
which bind to light chains. In turn, these carriers can 
be bound to the aforesaid carriers. 

The pharmaceutical additions that can be used are those 
which are normally used within this field, such as 
pharmaceutical qualities of mannitol, lactose, starch, 
magnesium stearate, sodium saccharate, talcum, cellu- 
lose, glycose, gelatine, saccharose, magnesium carbonate 
and similar extenders, such as lactose, dicalcium phos- 
phate and the like; bursting substances, such as starch 
or derivatives thereof; lubricants such as magnesium 
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stearate and the like; binders, such as starch, gum 
aribicum, polyvinylpyrrolidone, gelatine, cellulose and 
derivatives thereof, and the like. 

The invention will now be described in more detail with 
reference to the accompany drawings, in which 

Figure 1 illustrates the plasmid pHD389; the ribosomal 
binding sequence, the sequence for the signal peptide 
from ompA and recognition sequence for several restric- 
tion enzymes are shown; 
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Figure 2 illustrates the amino acid and nucleic acid 
sequence for protein LG; 

Figure 3 is a schematic overall view of the production 
of protein L; 

Figure 4 is a schematic overall view of the production 
of protein LG; 

Figures 5a, 5b and 5c are schematic overall views of the 
production of the hybrid proteins LA, LM and LH respec- 
tively; 

Figure 6 is a schematic inclusive illustration of pro- 
tein A, G, H and Ml. IgGFc-binding domains are for 
protein A: E, D, A, B and C; for protein G: Cl, C2 and 
C3; for protein H: A and/or B; ajid for protein Ml: A, 
Bl, B2, B3 and S; 



3v3 

m 



Figure 7 illustrates the amino acid and nucleic acid se- 
quence for protein Ml; 
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Figure 8 illustrates Western Blot for protein G, L and 
LG with certain immunoglobulins and immunoglubulin 
fragments; and 

5 Figure 9 illustrates Slot-Blot for protein L, G and LG 
with IgG, Igx and Ig Fc. 

The amino acid and nucleic acid sequence of the light- 
chain binding domains of protein L is illustrated in 
10 claims X and 2 respectively. 

It will be observed that the drawings are not to scale. 

Example I 

15 

Cloning and expression of the IgG-light-chain-binding 
domains j n protein ^ 

Construction of synthetic oligonucleotides (primers) for 
2 0 amplifying sequences coded for protein L, domain B1-B4 

It has been found that a protein h peptide (expressed in 

E. coli ) constructed of the sequence ala-val-glu-asn- 

domain Bl (from protein L) binds to the light chains of 

25 the immunoglobulins (W. Kastern, U. Sjobring and L. 

Bjorck. 1992. Structure of peptostreptococcal protein L 

and identification of a repeated immunoglobulin light 

chain-binding domain. J. Biol. Chem. in-print) . since 

this simple protein L-domain has a relatively low affin- 
7 -1 

30 ity to Ig, (1 x 10 M ), and since the naturally occur- 
ring protein L which is constructed of several mutually 

similar domains (B1-B5) has a high affinity to Ig (1 x 
10 -1 

10 M ) four of these domains have been expressed 
together in the following way: 

35 
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PL-N and PL-C1 are synthetic oligonucleotides (manufac- V 

tured by the Biomolecular Unit at Lund University (Sve- j£ 
den) in accordance with Applicant's instructions) which 

have been used to amplify a clonable gene fragment which ^ 

5 is amplified with PGR (Polymerase Chain Reaction) and K 
which codes for four Ig-binding protein L domains (ala- 
val-glu-asn-Bl-B2-B3-B4-lys-lys-val-asp-glu-lys-pro-glu- 

glu) . Amino acids in the protein L-sequence are given ^ 

for the primer which corresponds to the coded strand y»* 

10 (PL-N) : j£ 

* \ 

PL-N : 5 ' -GCTCAGGCGGCGCCGGTAGAAAATAAAGAAGAAACACCAGAAAC-3 ' £ 

valgluasnlysglugluthrproglu 

5 '-end of this oligonucleotide is homologous with the 
coded strand in the protein L-gene (emphasized) : those 
codons which code for the last three amino acids in the 

A-domain (val-glu-asn) are followed by the codons for jX 
2 0 the first six amino acids in the first of the Ig-binding 

domains in protein L (Bl) . y. 

PL-C1: 5 ' -CAGCAGCA^ATTCIIAITATTCTTCTGGTTTTTCG v; 

CTT-3 ' - 

25 ^ 

This oligonucleotide is homologous with the opposing 
non-coding strand in the gene for protein L (the se- 

»* 

quence corresponds to the first nine amino acids in A v » 

domain B5) . * JC? 

30 A". 

DNA- fragments which have been amplified with the aid of V* 

PL-N contain the recognition sequence for the restric- $vV 

tion enzyme Hpall (emphasized) immediately before the ^ 

codon which is considered to code for the first amino $C 

35 acid (val) in the expressed protein L-fragment. The ^y 

fragment which is cleaved with Hpall can be ligated with J?*' 

& 
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DNA (in this case, consisting of the used expression 
vector pHD389) which has been cleaved with the restric- 
tion enzyme Marl. The DNA— fragment that has been cleaved 
with Hpall and ligated with vector pHD389, which has 
5 been cleaved with Marl, will be translated in the cor- 
rect reading frame. The construction results in trans- 
lation of an additional amino acid (ala) immediately in 
front of the first amino acid in protein L. 

10 DNA-fragments which have been amplified with the aid of 
PL-Cl will contain the recognition sequence for the 
restriction enzyme BaaKI (over lined above the sequence) 
immediately after the sequence which codes for the last 
amino acid in the expressed protein L-f ragment (glu) . 

15 The vector pHD389 contains a unique recognition sequence 

for BamHI as part of its so-called multiple cloning 
sequence which follows the Narl recognition sequence. 
DNA-fragments which have been amplified with the aid of 
PL-Cl will include two so-called stop-codons (empha- 

20 sized) which results in translation of the fragment 

inserted in the vector to cease. 

The sequence which was considered to be amplified con- 
tains no internal recognition sequences for the restric- 
25 tion enzymes HpaXX or BamHI. 

Amplifying and cloning procedures 

(PCR) (Polymerase Chain Reaction) was effected with a 
30 protocol described by Saiki, R.D. Gelfand, S. Stoffel, 

S. Schzirf, R. Higuchi, G. Horn, K. Mullis and H. Erlich, 
1988; Primer-directed enzymatic amplification of DNA 
with a thermostable DNA polymerase. Science 239: 487- 
49127; PCR was effected in a Hybaid Intelligent Heating- 
35 block (Teddington, UK): 100 pi of a reaction mixture 

contained 50 mM KC1, 10 mM Tris-HCl, pH 8.3, 1.5 mM 
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of DNA (Sambrook, J-E. Fritsch and T. Maniatis, 1989. 
Molecular cloning: A laboratory manual. 2nd Ed. Cold 
Spring Harbor Laboratories, Cold Spring Harbor, New 
York, USA) . The cleaving and ligating conditions recom- 
5 mended by the manufacturer of DNA-ligase and restriction 
enzymes have been followed in other respects. 

Expression system 

10 The vector pHD389 (see Figure 2) is a modified variant 
of the plasmid pHD313 (Dalboge, H.E. Bech Jensen, H. 
Tottrup, A. Grubb, M. Abrahamson, I. Olafsson and 
S. Carlsen, 1989. High-level expression of active human 
cystatin C in Escherichia coli. Gene, 79: 325-332). The 

15 vector, which is replicated in E. coli (contains ori 

origin of replication from plasmid pUC19) is constructed 
so that DNA-fragments which have been cloned into the 
cleaving site of Karl will be transcribed and translated 
downstream of and in the immediate vicinity of the 

20 signal peptide (21 amino acids), from envelope -protein 

ompA from E. coli . Translation vill be initiated from 
the codon ATG which codes for the first amino acid 
(methionine) in the signal peptide. This construction 
permits the translated peptide to be transported to the 

25 periplasmic space in E. coli . This is advantageous, 

since it reduces the risk of degradation of the desired 
product of enzymes occurring intracellular ly in E. coli. 
Moreover, it is easier to purify peptides which have 
been exported to the periplasic space. Unique recog- 

30 nition sequences (multiple cloning sequences) for sever- 
al other restriction enzymes, among them ecoRI, Sail and 
BamHI are found immediately after the Karl cleaving 
site. An optimized so-called Shine-Dalgarno-sequence 
(also called ribosomal binding site, RBS) is found seven 

35 nucleotides upstream from the ATG -codon in the signal 

sequence from ompA, this optimized sequence binding to a 



SUBSTITUTE SHEET 



WO 93/22342 



PCT/SE93/00375 



20 

complementary sequence in 16S rRNA in the ribosomes and 
is responsible for the translation being initiated in 
the correct place. The transcription of such DNA as that 
which is co- transcribed with the signal sequence for 
5 ompA is controlled by the P -promotor from coliphage X. 

R 

The vector also contained the gene for cI857 from coli- 
phage X whose product down-regulates transcription from 
P (and whose product is expressed constitutive ly) . This 
cl 8 57 -mediated down-regulation of transcription from P R 

10 is heat-sensitive. The transcription regulated from this 
promotor is terminated with the aid of a so-called rho- 
independent transcription terminating sequence (forms a 
structure in DNA' t which results in the DNA-dependent 
RNA-polymerase leaving the DNA-strand) which is placed 

15 in the vector immediately downstream of the multiple 

cloning sequence. The plasmid also carries the 6-lacta- 
mase gene (from the plasmid pUCl9) whose product permits 
ampicillin-selection of E. coli clones that have been 
transformed by the vector. 



20 



Selection of protein L-producing clones 



The transformed bacteria are cultivated, or cultured, on 
culture plates with an LB-medium which also contained L 
25 ampicillin in a concentration of 100 pg/ml* Cultivation 

of the bacteria progressed overnight at 30°C, whereafter [\ 
the bacteria were transferred to an incubator where they v> 
were cultivated for a further 4 hours at 42°C. The £- 
plates were kept in a refrigerator overnight. On the 
3 0 next day, the colonies were transferred to nitrocellu- 

lose filters. Filters and culture plates were marked so •/ 
as to enable the transferred colonies to be readily 
identified on respective culture plates. The culture 
plates were again incubated overnight at 30 °C, so that 
35 remaining rests of transferred bacteria colonies could £ 
again grow. The plates were then kept in a refrigerator. 

§' 

y. 
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The bacteria in the colonies on the nitrocellulose- 
impressions were lysed by incubating the filter in 10% 
SDS for 10 minutes- Filters containing lysed bacteria 
were then rinsed with a blocking buffer which comprised 
5 PBS (pH 7.2) with 0.25% gelatine and 0.25% Tween-20 

(four baths, 250 ml each at 37°C), whereafter the filter 

was incubated with radioactively marked (marked with 
125 

I in accordance with the chloramin-T-method) Ig-x- 
chains (20 ng/ml in PBS with 0.1% gelatine). The incuba- 

10 tion took place at room temperature over a period of 3 

hours, whereafter non-bound radioactively marked protein 
was rinsed-off with PBS (pH 7.2) containing 
0.5 M NaCl, 0.25% gelatine and 0.25% Tween-20 (four 
baths, 250 ml each at room temperature) . All filters 

15 were exposed to X-ray film. Positive colonies were 

identified on the original culture plate. Clones which 
reacted with Ig-x-chains were selected and analyzed with 
respect to the size on the DNA-fragment introduced in 
the vector. One of these clones was selected for the 

20 production of protein L, pHDL. The DNA't introduced from 

this clone into plasmid pHD389 was sequenced. The DNA- 
sequence was found to be in full agreement with corre- 
sponding sequences (B1-B4 and 21 bases in B5) in the 
gene for protein L from Peptostreptococcus macrnus , 

25 strain 312. The size and binding properties of the 

protein produced by clone pHDL was analyzed with the aid 
of SDS-PAGE (see Figure 8) , dot-blot experiment (see 
Figure 9) and competitive binding experiments. 




3 0 Production of protein L 

Several colonies from a culture plate with E. coli pHDL 
were used to inoculate a preculture (LB-medium with an 
addition of 100 mg/1 ampicillin) , which was cultured at 
35 28°C overnight. On the following morning, the preculture 
was transferred to a larger volume (100 times the volume 
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of the preculture) of fresh LB -medium containing ampi- 
cillin (100 mg/1) and was cultured in shake-flasks (200 
rpm) , (or fermentors) at 28 °C. The culture temperature 
was raised to 40°c (induction of transcription) when the 
5 absorbency value at 620 nm reached 0.5. Cultivation then 

continued for 4 hours (applied solely to cultivation in 
shake-flasks) . Upon completion of the cultivation pro- 
cess, the bacteria were centrifuged down. The bacteria 
were then lysed with an osmotic shock method at 4°C 

10 (Dalboge et al., 1989 supra). The lysate was adjusted to 

a pH = 7. Remaining bacteria rests were then centrifuged 
down, whereafter the supernatent was purified on IgG- 
sepharose in accordance with earlier described protocol 
for protein G and protein L (U. Sjobring, L. Bjorck and 

15 W. Kastern. 1991.. Streptococcal protein G: Gene struc- 
ture and protein binding properties. J. Biol. Chem. 266: 
399-405; w. Kastern, U. Sjobring and L. Bjorck. 1992. 
Structure of peptostreptococcal protein L and identifi- 
cation of a repeated immunoglobulin light chain-binding 

20 doman. J. Biol. Chem. in-print. 

The expression system gave about 20 mg/1 of protein L 
when cultivation in shake-flasks. The culture was depos- 
ited at DSSM, Identification Reference DSSM E. coii 
25 LE392/pHDL. 
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Example 2 

Cloning and expression of protein LG 

5 Construction of oligonucleotides (primers) for amplify- 

ing sequence s vhich code for protein LG 

Protein L 

10 It has been found that a protein L-peptide (expressed in 

£. coli ) constructed of the sequence ala-val-glu-asn- 
domain Bl (from protein L) will bind to the light chains 
of the immunoglobulins (Kastern, Sjobring and Bjorck, 
1992, J* Biol. Chem. in-print) . Since the affinity of 

15 this simple domain to Ig is relatively low (1 x 10 M 

*) and since the naturally occurring protein L, which is 

comprised of several mutually similar domains (B1-B5) 

10 -1 

has a higher affinity to Ig (1 x 10 M ), four of 
these domains have been expressed together in the fol- 
2 0 lowing way: 

PL-N and PL-C2 are synthetic oligonucleotides (manufac- 
tured at the Biomolecular Unit at Lund University (Swe- 
den) in accordance with Applicant's instructions) which 
25 were used, with the aid of PCR (Polymerase Chain Reac- 
tion) to amplify a clonable gene fragment, called Bl-4, 
which codes for four Ig-binding protein L domains (ala- 
val-glu-asn-Bl-B2-B3-B4-lys-lys-val-asp-glu-lys-pro-glu- 
glu) : 

30 

PL-N : 5 ' -GCTCAGGCGG CGCCGGT AGAAAATAAAGAAGAAACACCAGAAAC- 3 ' 

valgluasnlysglugluthrproglu 

P1-C2 : 5 ' -CAGCAGCAGCC ATGGG TTCTTCTGGTTTTTCGTCAACTTTCTTA- 
35 3' 
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Amino acids have been shown under corresponding triplets 
in the coded strand. DNA- fragments which have been 
amplified with the aid of PL-N contain the recognition 
sequence for the restriction enzyme HpaXI immediately 
5 upstream of the triplet which codes for the first amino 
acid (val) in the expressed protein L-fragment. The 
fragment that has been cleaved with Hpall can be ligated 
with DNA (in this case, the used expression vector 
pHD389) which has been cleaved with Narl. The construe- 

10 tion results in translation of an extra amino acid (ala) 
immediately upstream of the first amino acid in the 
protein L-fragment. The DNA-f ragment that has been 
amplified with the aid of PL-C2 will contain the recog- 
nition sequence for the restriction enzyme HcoX (empha- 

15 sized) immediately downstream of the sequence which 

codes for the last amino acid in the expressed protein 
L-fragment (glu) . Amplified fragments which have been 
cleaved with Hcol can be ligated to the Ncol-cleaved, 
PCR-generated protein-asp-CDOmet-f ragment (see below) . 

20 

Protein G \ 

It is known that a simple C-domain from protein G will 
25 bind to IgG (B. Guss, M. Eliasson, A* Olsson, M. Uhlen, 

A.-K. Frej, H. Jornvall, I. Flock and M. Lindberg. 1986. 
Structure of the IgG-binding regions of streptococcal 
protein G. EMBO. J. 5: 1567-1575). The strength at which 
a simple C-domain binds to IgG Is relatively low 
30 (5 x 10 M ) . A fragment which consists of two C-do- 

mains with an intermediate D-region having a length of 

15 amino acids, however, has a considerably higher 

. . 9 "1 

affinity to IgG (1 x 10 M ). CDC-N and CDC-C are 

oligonucleotides which have been used as PCR-primers to 

3 5 amplify a clonable DNA- f ragment , designated CDC, which 



SUBSTITUTE SHEET 



WO 93/22342 



PCT/SE93/00375 



25 

codes for two IgG-binding protein G-domains (pro-met- 
asp-CDC-met) . 

CDC-N : GGCCATGGACACITACAAATTAATCCTTAATGGT 
5 metaspthrtyrlysleuileleuasngly 

CDOC : CAGGTCGACTTATTACATTTCAGTTACCGTAAAGGTCTTAGT 

Amino acids in the resultant sequence have been shown 
10 beneath the primer of the coding strand. DNA-f ragments 
which have been amplified with the aid of CDC-N contain 
the recognition sequence for the restriction enzyme Ncol 
(marked with a line above the sequence) . Cleaved ampli- 
fied fragments can be ligated with the fragment that has 
15 been amplified with the aid of PL-C2 and then cleaved 

with Ncol. The fragment will therewith be translated to 
the correct reading frame. DNA-f ragments which have been 
amplified with the aid of CDC-C will contain two so- 
called stop condons (emphasized) which terminate trans- 

2 0 lation. The recognition sequence for the restriction 

enzyme Sail (marked with a line above the sequence) 
follows immediately afterwards, this sequence also being 
found in the expression vector pHD389 (see Figure 1) . 

25 Those sequences which code for the binding properties of 

protein L (B1-B5) and for protein G (CDC) respectively 
contain no internal recognition sequences for the re- 
striction enzymes Hpall, Sail or Ncol. 

3 0 Amplif if ication and cloning procedures 

PCR (Polymerase Chain Reaction) was carried out in 
accordance with a protocol described by Saiki et al., 
1988; PCR was carried out in a Hybaid Intelligent Heat- 
3 5 ing-block (Teddington, UK) : 100 m1 of the reaction 

mixture contained 50 mM KC1, 10 mM Tris-HCl, pH 8.3, 1.5 
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mM MgCl^, 100 fig /ml gelatine, 300 jxM with respect to 
each of the deoxynucleotides (dATP, dCTP, dGTP, dTTP) , 
(Pharmacia) . In order to amplify sequences which code 
for the light-chain binding parts of protein L, there 
5 were added 20 pmol of each of the oligonucleotides PL-N 

and PL-C2, and 10 jil of a DNA-solution which contained 
0,1 mg/ml of chromosomal DNA from Peptostreptococcus 
macnus, strain 312. By way of an alternative, 20 pmol 
were added to each of the oligonucleotide pairs CDC-N 
10 and CDC-C and 10 Ml of a DNA-solution which contained 

0.1 mg/ml of chromosomal DNA from a group C streptococ- 
cus strain ( Streptococcus equisimilis ) called C40 
(U. Sjdbring, L. Bjorck and W. Kastern. 1991. Strepto- 
coccal protein G: Gene structure and protein binding 
15 properties. J. Biol. Chem. 266: 399-405 or with Ncol and 
Sail (10 U/jig PCR-product) , (for CDC) at 37°C. The thus 
amplified and subseqently cleaved DNA-f ragments were 
then separated by electrophoresis in a 2% (weight by 
volume) agrose gel (NuSieve agarose, FMC Bioproducts) in 
a TAE-buffer (40 mM Tris, 20 mM aNa-cetate, 2 mM EDTA, 
pH 8.0). The resultant fragments, 930 bp (for Bl-4) 
and 390 bp (for CDC) were cut from the gel. The 
concentration of DNA in the thus separated gel pieces 
was estimated to be 0.05 mg/ml. The agarose pieces cut 
from the gel and containing the cleaved, amplified 
fragments (Bl-4 and CDC) were melted in a water bath at 
65°C, whereafter they were allowed to cool to 37 °C. 
10 Ml (0.5 jig) of this DNA were transferred to a semi- 
microtube (Sarstedt) , preheated to 37°C, whereafter 1 pi 
30 of the vector pHD389 which had been cleaved with Narl 
and Sail were added. Hi 10 x ligase buffer (Promega) 
and 1 /il T4 DNA-ligase (1 unit/Ml) were also added. The 
ligating reaction was permitted to take place at 37 °c 
for 6 hours. The cleaving and ligating conditions recom- 
35 mended by the producer of DNA-ligase and restriction 

enzymes (Promega) were followed in other respects. The 
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ligating reaction was then used to transform E. coli , 
strain LE392, which had been made competent in accor- 
dance with the rubidium-chloride/calcium-dichloride 
method as described by Kushner (1978) . Manipulation of 
5 DNA was effected in accordance with molecular biological 
standard methods (Sambrook et al*, 1989). 

Expression system 

10 The vector pHD389 (see Figure 2) is a modified variant 

of the plasmid pHD313 (Dalboge et al., 1989). The vector 
which was replicated in E. coli (contains origin of 
replication from plasmid pUC19) is constructed such that 
DNA-fragments which have been cloned in the cleaving 

15 site for Karl will be expressed immediately after, or 

downstream , of the signal peptide (21 amino acids) from 
the envelope protein oapA from E. coli . Translation will 
be initiated from the ATG-codon which codes for the 
first amino acid (methionine) in the signal peptide. The 

2 0 construction with an E. coli -individual signal sequence 

which precedes the desired peptide enables the translat- 
ed peptide to be transported to the periplasmic space in 
E. coli . This is beneficial since it reduces the risk of 
degradation of the desired product through the intracel- 

25 lular occurrent enzymes of E. coli . Furthermore, it is 

easier to purify peptides which have been exported to 
the periplasmatic space. Unique recognition sequences 
(multiple cloning sequences) for several other restric- 
tion enzymes, among them EcoRI,- Sail and BamHI are 

30 present immediately downstream of the Karl cleaving 
site. An optimized so-called Shine-Dalgarno sequence 
(also called ribosomal binding site, RBS) is found seven 
nucleotides upstream of the ATG-codon in the signal 
sequence from ©mpA, this optimized Shbine-Dalgarno 

35 sequence binding to a complementary sequence in 16S rRNA 

in the ribosomes and in a manner to decide that the 



SUBSTITUTE SHEET 



WO 93/22342 



PCT/SE93/00375 



10 



15 



20 



25 



30 



35 



28 

translation is initiated in the correct place. The 
transcription of such DNA as that which is co-trans- 
cribed with the signal sequence for ompA is controlled 
by the P R -promotor from coliphage X. The vector also 
contains the gene for cI857 from coliphage X, the prod- 
uct of which regulates-down transcription from P R and 
the product of which is expressed const itutively . This 
cI857-mediated down-regulation of transcription from p^ 
is heat-sensitive* Transcription which is regulated, or 
controlled, from this promotor will be terminated with 
the aid of a so-called rho-independent transcription 
terminating sequence which is inserted in the vector 
immediately downstream of the multiple cloning site. The 
plasmid also carries the gene for 6-lactamase (from the 
plasmid pUCl9) , the product of which permits ampicillin- 
selection of E. coli clones that have been transformed 
with the vector. 

Selection of protein LG-produced clones 

The transformed bacteria are cultivated on culture 
plates with LB-medium which also contained ampicillin in 
a concentration of 100 Mg/ml. The bacteria were culti- 
vated overnight at 3 0°C, whereafter they were trans- 
ferred to a cultivation cabinet (42 °C) and cultured for 
a further four (4) hours. The plates were stored in a 
refrigerator overnight. On the following day, the colo- 
nies were transferred to nitrocellulose filters. The 
filters and culture plates were .marked, so that the 
transferred colonies could later be identified on the 
culture plate. The culture plates were again incubated 
overnight at 30 °c, so that rests of transferred bacteria 
colonies remaining on the plates could again grow. The 
plates were then stored in a refrigerator. The filter 
was incubated in 10% SDS for 10 minutes, so as to lyse 
the bacteria in the colonies on the nitrocellulose 
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impression* Filters containing lysed bacteria were then 

rinsed with a blocking buffer consisting of PBS (pH 7.2) 

with 0.25% gelatine and 0.25% Tween-20 (four baths of 

250 ml at 37°C) , whereafter the filter was incubated 

125 

5 with radioactively (marked with I according to the 
chloromine-T-method) marked Ig-*-chains (20 ng/ml) in 
PBS with 0.1% gelatine). The incubation process took 
place at room temperature for four (4) hours, whereafter 
non-bound radioactively marked protein was rinsed-off 

10 with PBS (pH 7.2) containing 0.5 M NaCl, 0.25% gelatine 

and 0.25% Tween-20 (four baths, 250 ml each at room tem- 
perature). All filters were exposed to X-ray film. 
Positive colonies on the original culture plate were 
identified. A number of positive colonies were re- 

15 cultivated on new plates and new colony-blot experiments 
were carried out with these plates as a starting materi- 
al with the intention of identifying n. coli colonies 
which bind IgG Fc. These tests were carried out in 
precisely the same manner as that described above with 

20 respect to the identification of £. coli -colonies which 

expressed Ig light-chain-binding protein, with the 

125 

exception that a radioactively marked* { I) IgG Fc (20 
ng/ml) was used as a probe. Clones which reacted with 
both proteins were selected and analyzed with regard to 

25 the size of the DNA-fragment introduced in the vector. 

One of these clones was chosen for production of protein 
LG, pHDLG. The DNA't taken from this clone and intro- 
duced into plasmid pHD389 was sequenced. The DNA-se- 
quence exhibited full agreement. with corresponding 

3 0 sequences (B1-B4 and 21 bases in B5) in the gene for 

protein L from Peptostreptococcus maqnus , strain 312, 
and with C1DC2 sequence in group C streptococcus strain 
C4 0. The size and binding properties of the protein 
produced from clone pHDLG was analyzed with the aid of 

3 5 SDS-PAGE (see Figure 8), dot-blot experiment (see Figure 

10) and competitive binding experiments. 
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Production of protein I*G 

Several colonies from a culture plate with £• coli pHDLG 
were used to inoculate a preculture (LB-raedium with an 
5 addition of 100 mg/1 ampicillin) were cultivated at 28°C 
overnight. In the morning, the preculture was trans- 
ferred to a larger volume (100 times the volume of the 
preculture) of fresh LB-medium containing ampicillin 
(100 mg/1) and was cultivated in vibrating flasks (200 

10 rpm) , (or fermenters) at 28°C. When an absorbence value 

of 0.5 was reached at 620 nm, the cultivation tempera- 
ture was raised to 40°C (induction of transcription). 
The cultivation process was then continued for 4 hours 
(applies only to cultivation in vibrated flasks) . The 

15 bacteria were centrifuged down upon termination of the 

cultivation process. The bacteria were then lysed at 4°C 
in accordance with an osmotic shock method (Dalboge et 
al., 1989). The lysate was adjusted to a pH of 7. Re- 
maining bacteria rests were centrifuged down and the 

2 0 supernatent then purified on IgG-sepharose, in accor- 

dance with the protocol earlier described with reference 
to protein G and protein L. (Sjobring et al., 1991, 
Kastern et al. , 1992). 

25 The expression system gave about 30 mg/1 of protein LG 

when cultivation in vibrated flasks. A deposition has 
been made at DSSM, Identification Reference DSSM E. coli 
LE3 92/pHDLG. 

30 Example 3 

Analysis of the binding properties of protein LG 

Western Blot 

35 

Protein G (the ClDC2-f ragment) , protein L (four B- 
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10 



domains) and protein LG were isolated with SDS-PAGE (1C% 
acrylamide concentration) . The isolated proteins were 
transfered to nitrocellulose membranes in three similar 
copies (triplicate) . Each of these membranes was incu- 
bated with radioactively marked proteins (20 ng/ml: one 
of the membrane-copies was incubated with human poly- 
clonal IgG, another with human IgG Fc-fragment and the 
third with isolated human IgG xchains. Non-bound radio- 
actively marked proteins were rinsed off and all filters 
were then exposed to X-ray film. 
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Slot-blot 

Human polyclonal Ig-preparations and Ig-fragments were 
applied with the aid of a slot-blot appliances on nitro- 
cellulose filters in given quantities (see Figure 10) on 
three similar copies. Each of these membranes was incu- 
bated with radioactively marked proteins (20 ng/ml) . One 
of the membrane copies was incubated with protein LG, 
another with protein L and the third with protein G. 
Non-bound radioactively marked proteins were rinsed-off 
and all filters were then exposed to X-ray film. 

The results are shown in Figures 9 and 10, 

Other binding experiments have been carried out, with 
the following results: 
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Binding of the proteins G, h and LG to immunoglobulins. 



Binding protein: 



K 



LG 



Immunoglobulin 



Polyclonal IgG 
IgG subclasses 
IgG 

IgG 4 
IgG fragment 
Fc 

F(ab') 2 * 
kappa 
lambda 
Other Ig-classes 
IgH 
IgA 
IgE 
IgD 



67 (10) 

2.0 
3.1 
6.1 
4.7 



+ 
+ 



+ 6.0 (0.5) 

0.4 (0.2) + 



9.0 



1.5 



11.6 
10.4 



20 



+ 



+ 



Other Species: 
Polyclonal 
Monkey + 
Rabbit IgG 
IgG-Fc 

IgG-F(ab') 2 
Mouse 
Rat 
Goat 



70 
3.0 
0.44 

41 
1.5 

14 



0.074 



2.6 
0.39 



+ 
+ 
+ 
+ 
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I&BLE (cont'd.) 



Binding of the proteins G, L and LG to immunoglobulins. 



Binding protein: G K 



L K 



LG K 



Immunoglobul in 



Bovine 
Horse 

Guinea Pig 

Sheep 

Dog 

Pig 

Hamster 

Cat 

Hen 

Monclonals 

Mouse 

IgG n 



IgG 1 



+ 

+ 

+ 



+ 
+ 
+ 

+ 



IgG 
IgG 
IgG. 
IgM~ 
IgA 
Rat 
IgG 
IgG 
IgG 



2a 
2b 



2a 
2b 
2c 



+ 
+ 



+ 
+ 



+ 



+ 
+ 



+ 
+ 
+ 



■ » 

W 

I 
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-X * 

K = affinity constant (M ). The minerals within parenthesis 
a 

disclose the affinity of a recombinant protein G comprised of two 
IgG-binding domains. *A weak bond to lambda chains exists. 
Binding to PI and PLG depends on the type of light chain of Ig. 

It will thus be seen that the synthesized hybrid protein LG has a 
broad binding activity/specificity. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: HighTech Receptro AB 

(B) STREET: c/o Active, Skeppsbron 2 

(C) CITY: MALMO 

(E) COUNTRY: SWEDEN 

(F) POSTAL CODE (ZIP) : 211 20 

(G) TELEPHONE: 040/35 07 00 

(H) TELEFAX: 040/ 23 74 05 

(I) TELEX: 32637 Active S 

(ii) TITLE OF INVENTION: Hybridprotein 
(iii) NUMBER OF SEQUENCES: 1 

(iv) COMPUTER READABLE FORM: * 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: SE PCT/SE93 / 00375 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: SE 92013 31-7 

(B) FILING DATE: 2 8 -APR- 199 2 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli LE392/pHDL, DSM 7054 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Ala Val Glu Asn Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser 
1 5 io 15 

Glu Glu Glu Val Thr He Lys Ala Asn Leu He Phe Ala Asn Gly Ser 
20 25 30 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu 
35 40 45 
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Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly 
65 70 75 80 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 
85 90 95 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 105 110 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
130 135 140 

Thr Leu Asn He Lys Phe Ala Gly Lys Glu Lys Thr Pro Glu Glu Pro 
145 150 155 160 

Lys Glu Glu Val Thr He Lys Ala Asn Leu He Tyr Ala Asp Gly Lys 
165 170 175 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu 
180 iss 190 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly 
210 215 220 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
225 230 235 240 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
245 250 255 

Thr Phe Ala Glu Ala Thr Ala Glu \la Tyr Arg Tyr Ala Asp Leu Leu 
260 265 270 

Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
275 280 , 285 

Thr He Asn He Arg Phe Ala Gly Lys Lys Val Asp Glu Lys Pro Glu 
290 295 300 

Glu 
305 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli LE392/pHDL, DSM 7054 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GCGGTAGAAA ATAAAGAAGA AACACCAGAA ACACCAGAAA CTGATTCAGA 50 

AGAAGAAGTA ACAATCAAAG CTAACCTAAT CTTTGCAAAT GGAAGCACAC 100 

AAACTGCAGA ATTCAAAGGA ACATTTGAAA AAGCAACATC AGAAGCTTAT 150 

GCGTaTGCAG ATACTTTGAA GAAAGACAAT GGAGAATATA CTGTAGATGT 200 

TGCAGATAAA GGTTATACTT TAAATATTAA ATTTG CTGG A AAAGAAAAAA 250 

CACCAGAAGA ACCAAAAGAA GAAGTTACTA TTAAAGCAAA CTTAATCTAT 3 00 

GCAGATGGAA AAACACAAAC AGCAGAATTC AAAGGAACAT TTGAAGAAGC 350 

AACAGCAGAA GCATACAGAT ATGCAGATGC ATTAAAGAAG GACAATGGAG 400 

AATATACAGT AGACGTTGCA GATAAAGGTT ATACTTTAAA TATTAAATTT 450 

GCTGGAAAAG AAAAAACACC AGAAGAACCA AAAGAAGAAG TTACTATTAA 500 

AGCAAACTTA ATCTATGCAG ATGGAAAAAC AC^AACAGCA GAATTCAAAG 550 

GAACATTTGA AGAAGCAACA GCAGAAGCAT ACAGATATGC TGACTTATTA 600 

GCAAAAGAAA ATGGTAAATA TACAGTAGAC GTTGCAGATA AAGGTTATAC 650 

TTTAAATATT AAATTTGCTG GAAAAGAAAA AACACCAGAA GAACCAAAAG 700 

AAGAAGTTAC TATTAAAGCA AACTTAATCT ATGCAGATGG AAAAACTCAA 750 

ACAGCAGAGT TCAAAGGAAC ATTTG CAG AA GCAACAGCAG AAGCATACAG 800 

ATACGCTGAC TTATTAGCAA AAGAAAATGG TAAATATACA GCAGACTTAG 850 

AAGATGGTGG ATACACTATT AATATTAGAT TTGCAGGTAA GAAAGTTGAC 900 



GAAAAACCAG AAGAATAATA A 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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Glu Glu Glu Val Thr He Lys Ala Asn Leu He Phe Ala Asn Gly Ser 
20 25 30 



Ala Tyr Ala Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 



Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
225 230 235 240 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
245 250 255 

Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
260 265 270 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: & 
(A) ORGANISM: Escherichia coli LE3 92/ pHDLG , DSM 7055 £ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Val Glu Asn Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser v£ 
15 10 15 ^ 



Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu ^ 
35 40 45 £v 



Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly i>: 
65 70 75 80 £ 



Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
85 90 95 

Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 105 110 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 

130 135 140 JV 

*\, 

Thr Leu Asn He Lys Phe Ala Gly Lys Glu Lys Thr Pro Glu Glu Pro £ 
145 150 155 160 - 

Lys Glu Glu Val Thr He Lys Ala Asn Leu He Tyr Ala Asp Gly Lys y 
165 170 175 

Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu V: 
180 185 190 & 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 
195 200 205 

\* 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly !> 
210 215 220 



WO 93/22342 

Ala Lys Glu 
275 

Thr lie Asn 
290 

Glu Pro Met 

305 

Gly Glu Thr 

Phe Lys Gin 

Asp Asp Ala 
355 

Asp Ala Ser 
370 

Asn Gly Lys 
385 

Glu Thr Ala 
Asp Gly Val 
Glu Met 



39 



Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp 
280 285 

He Arg Phe Ala Gly Lys Lys Val Asp Glu 
295 300 

Asp Thr Tyr Lys Leu He Leu Asn Gly Lys 
310 315 

Thr Thr Glu Ala Val Asp Ala Ala Thr Ala 
325 330 

Tyr Ala Asn Asp Asn Gly Val Asp Gly Glu 
340 345 

Thr Lys Thr Phe Thr Val Thr Glu Lys Pro 
360 365 

Glu Leu Thr Pro Ala Val Thr Thr Tyr Lys 
375 380 



Thr Leu Lys Gly Glu Thr 
390 

Glu Lys Ala Phe Lys Gin 
405 

Trp Thr Tyr Asp Asp Ala 
420 425 



Thr Thr 

395 



Lys Ala 
Asn Asp 
Thr Lys Thr Phe 



Tyr Ala 
410 



PCT/SE93/00375 
Gly Gly Tyr 
Lys Pro Glu 



Thr Leu Lys 
320 

Glu Lys Val 
335 

Trp Thr Tyr 
350 

Glu Val He 



Leu Val He 



Val Asp Ala 
400 

Asn Gly Val 
415 

Thr Val Thr 
430 



5V 



(2) INFORMATION FOR SEQ ID NO: 4: * 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichis coli L392/pHDLG # DSM 7055 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GCGGTAGAAA ATAAAGAAGA AACACCAGAA ACACCAGAAA CTGATTCAGA 50 

AGAAGAAGTA ACAATCAAAG CTAACCTAAT CTTTGCAAAT GGAAGCACAC 100 

AAACTGCAGA ATTCAAAGGA ACATTTGAAA AAGCAACATC AGAAGCTTAT 150 

GCGTATGCAG ATACTTTGAA GAAAGACAAT GGAGAATATA CTGTAGATGT 200 
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■M 



3K 



TGCAGATAAA 


GGTTATACTT 


TAAATATTAA 


ATTTGCTGGA 


AAAGAAAAAA 


250 


CACCAGAAGA 


ACCAAAAGAA 


GAAGTTACTA 


TT AAA GC AAA 


CTTAATCTAT 


300 


GCAGATGGAA 


AAACACAAAC 


AGCAGAATTC 


AAAGGAACAT 


TTGAAGAAGC 


350 


AACAGCAGAA 


GCATACAGAT 


ATGCAGATGC 


ATTAAAGAAG 


GACAATGGAG 


400 


AATATACAGT 


AGACGTTGCA 


GATAAAGGTT 


ATACTTTAAA 


TATTAAATTT 


450 


GCTGGAAAAG 


AAAAAACACC 


AGAAGAACCA 


AAAGAAGAAG 


TTACTATTAA 


500 


AGCAAACTTA 


ATCTATGCAG 


ATGGAAAAAC 


ACAAACAGCA 


GAATTCAAAG 


550 


GAACATTTGA 


AGAAGCAACA 


GCAGAAGCAT 


ACAGATATGC 


TGACTTATTA 


600 


GCAAAAGAAA 


ATGGTAAATA 


TACAGTAGAC 


GTTGCAGATA 


AAGGTTATAC 


650 


TTTAAATATT 


AAATTTGCTG 


GAAAAGAAAA 


AACACCAGAA 


GAACCAAAAG 


700 


AAGAAGTTAC 


TATTAAAGCA 


AACTTAATCT 


ATGCAGATGG 


AAAAACTCAA 


750 


ACAGCAGAGT 


TCAAAGGAAC 


ATTTGCAGAA 


GCAACAGCAG 


AAGCATACAG 


800 


ATACGCTGAC 


TTATTAGCAA 


AAGAAAATGG 


TAAATATACA 


GCAGACTTAG 


850 


AAGATGGTGG 


ATACACTATT 


AATATTAGAT 


TTGCAGGTAA 


GAAAGTTGAC 


900 


GAAAAACCAG 


AAGAACCCAT 


GGACACTTAC 


AAATTAATCC 


TTAATGGTAA 


950 


AACATTGAAA 


GGCGAAACAA 


CTACTGAAGC 


TGTTGATGCT 


GCTACTGCAG 


1000 


AAAAAGTCTT 


CAAACAATAC 


GCTAACGACA 


ACGGTGTTGA 


CGGTGAATGG 


1050 


ACTTACGACG 


ATGCGACTAA 


GACCTTTACA 


GTTACTGAAA 


AACCAGAAGT 


1100 


GATCGATGCG 


TCTGAATTAA 


CACCAGCCGT 


GACAACTTAC 


AAACTTGTTA 


1150 


TTAATGGTAA 


AACATTGAAA 


GGCGAAACAA 


CTACTAAAGC 


AGTAGACGCA 


1200 


GAAACTGCAG 


AAAAAGCCTT 


CAAACAATAC 


GCTAACGACA 


ACGGTGTTGA 


1250 


TGGTGTTTGG 


ACTTATGATG 


ATGCGACTAA 


GACCTTTACG 


GTAACTGAAA 


1300 


TGTAATAA 










1308 



SUBSTITUTE SHEET 



'A 



41 

y Protein L having the abiliry to bind to the ligh.r 
chains of immunoglobulins , characterized 
in that the protein L has the following amino acid 
sequence: 

31 

Ala Val Glu AeniLya Clu Giu Thr Pro Clu Thr Pro Clu Tftr Asp Ser 
i ; 5 10 15 

Clu Glu Clu Val Thr lie Lye Ala Aen Leu lie Phe Ala Aen Gly Ser 
20 25 30 

Thr Gin Thr Ala Giu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu 
35 <Q " 4S 

Ala Tyr Ala Tyr Ala Asp Thr Leu Lya Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 

Vel A*p Vel Ala Aep Lya Gly Tyr Ttxr Leu A«n lie Lye Phe Ala Gly 

65 32 70 75 80 

,Ly* Glu Lys Thr Pro Giu Glu Pro Lya Glu Glu Val Thr He Lye Ala 
as 90 95 

Asn L*u lie Tyr Ala Acp Ciy lys Thr Gin Thr Ala Glu Phe Lys Gly 
100 1Q5 110 

Thr Phe Glu Glu Ala Thr Ala Glu Ala Tvr Arx? Tyr Ala Asp Ala Leu 
115 120 * 125 

Lya Ly« Asp A«n Cly Giu Tyr Thr Val Asp Val Ala Aep Lys Gly Tyr 

130 125 140 

B3 

Thr L*u Aan lie Lye Phe Ala Gly Lys Glu Lye Thr Pro Glu Giu Pre 
14 5 " 150 155 160 

Lys Glu Giu Val Thr lie Lya Ala Asn Leu lie Tyr Ala Asp Gly Lys 
165 170 175 

Thr Gin Thr Ala Giu Pbe Lye Gly Thr Phe Glu Glu Ala Thr Ala Giu 
180 185 190 

Ala Tyr Arg Tyr Ala Asp Leu Leu Ala Lya Glu A*n Gly Lye Tyr Thr 

195 -200 205 

Val Asp val Ala Asp Lys Gly Tyr Thr L*u A»n lie Lye Phe Ala Gly 

215 220 
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S4 £ 

->s cia Lys -ir ?rc c.j SK Fro Lys Si. sia Val 7.-r He Us Ala \r 



Asn Uu lie Tyr Ala Asp ciy Lys Thr Gin Thr Ala Glu Ph. Lys Gly 



280 — a5 285 



Thr lie Asn lie Arg Ph e Ma Gly j Lys Lys Val Asp Glu Lys Pro Glu 

Ciy 



250 295 . 300 



30 



2iC 



245 2S0 255 V 



Thr Ph. Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp x* u Leu K 

260 265 270 r* 

A! a Lys Clu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tvr ¥ 

275 5fln^- -*c * jv. 



an, v ariants , subfragneRts> I 
*»~». Bl-35 having the saDl e binding pnp ^ 0 ^ °~ 



2. DKA-seguence, c h a r a c - . ~ < , „' • $■ 

2 «> It codes f-»- r 1 2 e d 111 that 5; 

. codes t „ rhe protein accQrdi ^ ^ 

fevering nucleotide sequence: " ~ 

=CS CIA CAA AAX AAA GAA GAA ACA CCA GAA ACA ~CA * ~» 

CAA rtCA *-^-A CAA ACT GAT TC\ s *V-' 

"* C ^ ~* *K TTT GCA AAX CCA ACC » 5" 



"» «» «T C« „ ^ ^ ^ „ ^ ^ ^ ^ ^ 

-» tat M E ca Cir ACI „ uc ^ ^ ut ^ ^ a _ ^ 
<=ta «r err ra «, ^ IiT ^ ^ aat att aaa rrr =ct ;=a 

** T " C T " ~ ttT ~ «* - « « «u « aaa «, ™ | 

- ; , =AA «, A=A « CAA SCA TAC «A TAX =CA SAT „ TTA „ j? 

'-" G AUG GAC AAT GGA GAA -IT . , 

" ^ GTA CAC CTT GCA GAT AAA GGT TAT .J 2 ^ 
i-A AAT ATT AVX ^» _ 

^ — «w. ==A ^AA GAA AAA ACA CCA GAA GAA CCA „ 80 ^ 
.-^•^ GAA GAA GTT A^T *~ , . v |V ' 

^X... A*A cat AAC TTA »T CCX CAT CCA AW ^ 
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10 



15 



ACA CAA ACA GCA GAA -.C AAA G cA ACA TTT GAA GAA 3CA nCA SCA £AA i: ~ 6 

GCA 7AC AGA TAT GC7 GAC TTA "A GCA AAA GAA AAT GGT AAA TAT ACA d2c 

GTA GAC GTT GCA GAT AAA GGT TAT ACT TTA AAT ATT AAA TTT'CCT GGA 572 

AAA GAA AAA ACA CCA GAA GAA CCA AAA GAA GAA GTT ACT ATf AAA GCA '22 

AAC TTA ATC TAT GCA GAT GGA AAA ACT CAA ACA GCA GAG TTC AAA GGA ~63 

ACA TTT GCA GAA GCA ACA GCA GAA GCA TAC AGA TAC GCT GAC TTA TTA S'£ 

GCA AAA GAA AAT GGT AAA TAT ACA GCA GAC TTA GAA GAT GGT GG£ TAC 564. 

ACT ATT AAT ATT AGA TTT GCA GGT AAG AAA GTT GAC GAA AAA CCA GAA 912 



3. A hybrid prorein, characterized m 
-that i- includes one or more of the 3 1-B5- domains ac- 
cording to Claim l vhicn hind to the light chains in 
irununogiobulins of all classes, and domains which bind 
ro heavy chains in immunoglobulin G. 

4. A hybrid protein according to Claim 3, char- 
acterized in that the domains which bind ro 
heavy chains in inmunoglobuiin G are chosen from aaong 
the Cl- and C2 -domains in protein G or from among any 
other functionally similar proteins which bind no heavy 
chains in immunoglobulin G, and variants, subfragaents , 
multiples or mixtures thereof having the saa* binding 
properties, 

5. A hybrid protein according to claim 4, char- 
acterized in that the hybrid protein has the 
following amino acid sequence; 
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AL* Vai ~sn W* Sip Ciu Thr Pro Olo Thr ?rs Glu Asp Ser 

fr-lu SI- Gli; Val Thr He Lvs Ala Asn Leu lie ?he Ala Asn Gly Ser 
20 25 30 

Thr sir. Thr Ale Giu ?he Lys Gly Thr Pha clu Lys Ala Thr ser Glu 

35 40 45 

Ala Tyr Ala Tyr Ala Asp Thr Leu lys Lys Asp Asn Gly Glu Tyr Thr 
50 55 60 

Val Asp Val Ala Asp Lvs Gly Tyr Thr Leu Asn He Lys »e Ala Gly 
65 70 75 sc 

Lvs Glu Lvs Thr ?rc Glu Glu Pro Lys Glu Glu 'Val Thr He Lys Aid 
53 9C 95 

Asr. Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Sly 
100 ' " 105 HO 

Thr £he Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
115 120 125 

Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala As? Lys Gly Tyr 
130 135 140 

Thr Leu Asn lie Lvs Phe Ala Glv Lys Glu Lys Thr Pro Glu Glu Pre 
145 * 150 * 155 -5C 

Lvs Glu Glu Val Thr He Lvs Ala Asn Leu Ha Tyr Ala Asp Gly Lys 
A> * 1€5 ' 170 

?tr Gin Thr Ala Glu ?he Lys Gly Thr Pha Glu Glu Ala Tnr Ala Glu 
180 185 190 

Ala zyr Arg Tyr Ala As? Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr 

30 195 200 205 

Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn lit Lys Ffce Ait Gly 
210 215 220 

Lvs Glu Lys Tnr Pro Glu Glu Pro 'Lys Glu Glu Val Thr lie Lys Ala 
225 230 235 240 

" £ Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu ?he Lys Gly 

245 250 255 



20 
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Tr.r Phe Ala Glu Ala Tr.r AI* Glu Ala Tyr Arg Tyr Ala Asr> Leu Leu 
26C 265 270 

Ala Lys Glu Asr. Gly Lys Tyr Thr Ala Asp Leu Glu Asp Glv Cly Ty~ 

* 75 2B0 2fi5 * 



Thr Cle Asn lie Arg Phe Al 



Ily Lys Ly$ Val Asp Glu Lys Pro Glu 



300 

aLu Prp Ker Asp TKr Tyr Lys lie Leu Asr. ~:y lys Tr.r lys 

Gly Glu Thr Thr Thr GIj Ala Yal Asp Ala Ala Thr Ala Glu Lys Yal 
-25 330 235 

?ne Lys Gin Tyr Ala Asr. Asp Asn Gly Val Asp Gly Glu Trp Thr Tyr 
3*0 345 

Asp Asp Ala Thr Lys Thr Phe Thr Val Thr Glu Lys Pro Glu Val lie 

3e 0 365 

Asp Ala Ser Glu Leu Thr Pro Ala Val Thr Thr Tyr Lys Leu Val He 
370 375 310 

Asr. Gly Lys Thr _ Leu Lys Gly Glu Thr Thr Thr Lys Ala Val Asp Ala 

= 190 295 40C 

Slu Thr Ala Glu Lys Ala ?he Lys Gin Tyr Ala Asn Asp Asn Gly Val 
405 

Asp Gly val Trr Thr Tyr Asp Asp Ala Thr Lvs Thr Phe Thr Val Thr 

425 43c 

Glu Met 



and variants, *ul>f rag»«tts , aultiples or mixtures of the 
domains Bl-BS having tne saxe binding properties., 

S. DNA-**su*nee, characterized in that 
it codes for a prorein according to Claim s and ha* the 
following nucleotide sequence; 
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see 


w . A 


. GAA 


AAT 


AAA 


GAA 


GAA 


ACA 


CCA 


GAA 




CCA 


GAA 


ACT 


GAT 


, *,r. 


-c 


SA_A 


GAA 


r- » • 




ACA 


A- C 


AAA 


GCT 




CTA 


ATC 


TTT 


GCA 


AAT 




AGC 




ACA 


CAA 


A 27 


GCA 


GAA 


TTC 


AAA 


GGA 


ACA 


TTT 


•» » 


AAA 


GCA 


ACA 


TCA 


GAA 




GCT 


TAT 




* AT 


GCA 


GAT 


ACT 


* - V* 


AAG 


AAA 


GAC 


AAT 


GGA 


GAA 


TAT 


ACT 




GTA 


GAT 


CTT 


GCA 


GAT 


AAA 


GST 




ACT 


TTA 


AAT 


ATT 


AAA 


TTT 


GCT 


GGA 




AAA 


GAA 


AAA 


ACA 


CCA 


GAA 


GAA 


CCA 


AAA 


GAA 


GAA 


GTT 


ACT 


ATT 


AAA 


CCA 


25S 


AAC 


TTA 


ATC 


TAj. 


GCA 


GAT 


GGA 


AAA 


ACA 


CAA 


ACA 


GCA 


GAA 


TTC 


AAA 


GGA 




ACA 


TTT 


GAA 


GAA 


GCA 


ACA 


GCA 


s~ \ \ 


GCA 


TAC 


ACA 


TAT 


GCA 


GAT 


GCA 




3&I 


AAG 


AAG 


GAC 


AAT 


GGA 


GAA 


TAT 


ACA 


GTA 


GAC 


GTT 


GCA 


GAT 


AAA 


GGT 


TAT 




ACT 


TTA 


AAT 


ATT 


AAA 




CCT 


GGA 


AAA 


GAA 


AAA 


ACA 


CCA 


GAA 


GAA 


CCA 




AAA 


GAA 


GAA 


GTT 




£ m*r- 


AAA 


wA 


•* ■» ^ 




ATC 


TAT 


GCA 


GAT 




AAA 




ACA 


*-AA 


ACA 




GAA 


7"»r 


AAA 




ACA 




GAA 


GAA 


CCA 


ACA 


GCA 


GAA. 





GCT GGA 67 



3D 



GCA TAC AGA TAT GCT GAC TTA TTA GCA AAA GAA AAT GGT AAA 
GTA GAC GTT GCA GAT AAA GGT TAT ACT TTA AAT ATT AAA TTT 
AAA GAA AAA ACA CCA GAA GAA CCA AAA GAA GAA GTT ACT ATT AAA GCA ~ZZ 
>AC TTA ATC TAT GCA GAT GGA AAA ACT CAA ACA GCA GAG CTC AAA GGA "65 
ACA TTT GCA GAA GCA ACA GCA GAA GCA TAC AGA TAC GCT GAC TTA TTA z 
GCA AAA GAA AAT GGT AAA TAT ACA GCA GAC TTA GAA GAT GGT GGA TAC ^ 
ACT ATT AAT ATT AGA TTT GCA GGT AAG AAA GTT GAC GAA AAA CCA GAA f; 
GAA CCC ATG GAC ACT TAC AAA TTA ATC CTT AAT GGT AAA ACA TTG AAA 
C-CC GAA ACA ACT ACT GAA GCT GTT GAT GCT GCT ACT GCA GAA AAA GTC 
TTC AAA CAA TAC GCT AAC GAC AAC GGT GTT GAC GGT GAaTgG ACT TAC 
GAC GAT GCG ACT AAG ACC TTT ACA GTT ACT GAA AAA CCA GAA GTG ATC 
GAT GCG TCT GAA TTA ACA CCA GCC GTG ACA ACT TAC AAA CTT CTT ATT 
AAT GGT AAA ACA TTG AAA GGC GAA ACA ACT ACT AAA GCA GTA GAC GCA 
GAA ACT GCA GAA AAA GCC TTC AAA CAA TAC GCT AAC GAC AAC GGT GTT 12ftS 
GAT GGT GTT TGG ACT TAT GAT GAT GCG ACT AAG ACC TTT ACG GTA ACT :2 9t> 
GAA ATG TAATAA . 302 
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7. DNA-sequence, character! 'zed in that 
it codes fcr a protein according tc Claims 3, 4 and 5. 

6. A plasmid vector f characterized ir. 
5 that it includes a DNA-seguence according to any one of 
Claims 2 ^nd._6r3_> preferably the vector pHDLG or pHDL 
according to ?ig. 3 or 4, 

9. A host cell, characterized in that it 
10 is transformed with the hybrid plasmid according to 

Claijn 9 in particular a host which belongs tc the 
species E. coli , particularly 2. coli LE392, or Bacillus 
subtilis, Saccarosvces cercvisiae , preferably Id. Kef. 
DSSM 5. coli ^H!39 2 pHDL and £. coli LZ292/pKDUZ ras- 
15 pectively. 

10 . A method for producing a protein according to 
Claims 1 and 3-5_, characterized by cultiv- 
ating a host cell according to Claim 10 under suitable 

2 C conditions; accumulating the protein in. the culture or 

lysing the cells and extracting the protein therefrom. 

11. A reagent kit for binding, separating and identi- 
fying immunoglobulins, characterized in 

2 5 that it includes a protein according to any one of 

Claims 1 and 3-5. 



12. A composition, characterized in that 
ir. includes a protein according to any one of Claims 2 
3 0 and 3-5, and optionally additives or carriers. 



13. A pharmaceutical composition, charac- 
terized in that it includes a protein according 
to any one of Claims 1 and 3-5, and optionally a pharaa- 
35 cautically acceptable carrier ~cr extender. 
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5> . 



The invention relates to sequences of protein L which bind 
to light chains of immunoglobulins. The invention also relates 
to hybrid proteins thereof which are able to bind to both light 
and heavy chains of immunoglobulin G, in particular protein LG. 
The invention also relates to DNA-sequences which code for the 
proteins, vectors which include such DNA-sequences, host cells 
which have been transformed with the vectors, methods for 
producing the proteins, reagent appliances for separation and 
identification of immunoglobulins, compositions and 
pharmaceutical compositions and pharmaceutical compositions which 
contain the proteins. 
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.... as Application Serial No 
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filing date of this application 
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100] of Titie 18 of the I'm ted States 0>de and that si>ch willful false statements may je> . -uize the validity of the application or any patent issued thereon 
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and to transact all businet* n the PaUr.t and Trademark Office connected therewith ar.d with the resulting patent, and 1 hereby authorize them to act and rely on 
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Applicant or Patentee; Lars BJQRCK et al. Attorney's DWB 
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For: PROTEIN L AND HYBRID PROTEINS THEREOF 



VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY 
STATUS (37 CFR 1.9 (0 and 1.27 (b)) — INDEPENDENT INVENTOR 



Ai a below named inventor, I hereby declare that I qualify as an independent inventor as defined in 37 CFR 1.9 (c) for pur- 
poses of paytng reduced fees under section 41 (a) and (b) of Title 35. United States Code, to the Patent and Trademark 
Office with regard to the invention entitled PROTETN T. AND HYBRID PROTEINS THEREOF 
described in 

P J the specification filed herewith 

I J application serial no. w Hied October 26, 1994 . 

I J Pa'cnt no. __ , issued \ 



I have not assigned, framed, conveyed or licensed and am under no obligation under contract or law to assign, grant. con»ey 
or license, any rights in the invention to any person who could not be classified as an independent inventor under 37 CFR 
1.9 (c) if that person had made the invention, or to any concern which would not qualify as a small business concern under 
37 CFR 1.9 id) or a nonprofit organization under 37 CFR 1.9 (e). 

Each person, concern or organization io which I have assigned, granied, conveyed, or licensed or am under an obligation 
under contract or law to assign, grant, convey, or license any rights in the invention is listed below: 

f ) no such person, concern, or organization 

1X1 Persons, concerns or organizations listed below* 

•NOTE; Separate verified statements are required from each named person, concern or organiza- 
tion having rights to the invention averring to their status as small entities {37 CFR 1.27) 

FULL name HiqhTech Receptor AB 

ADDRESS c/q Active i Ma l ino AB , Stor* # S- 211 ^7 M alma 
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I acknowledge the duty to fife, in this application or patent, notification of any change in status resulting in loss of entitle- 
mem to small entity status prior to paying, or at the lime of paying, the earliest of the issue fee or any maintenance fee 
due after the date on which status as a small entity is no longer appropriate. (37 CFR 1.28 (b» 

I hereby declare that all statements made herein of my own knowledge are true and that alt statements made on information 
and belief are believed to be true; and further that these statements were made with the knowledge that willful false statements 
and the like so made are punishable by fine or imprisonment, or both, under section 1001 of Title IS of the United States 
Code, and that such willful false statements may jeopardize the validity of the application, any patent issuing thereon, or 
any patent to which this verified statement is directed. 
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VERIFIED STATEMENT {DECLARATION) CLAIMING SMALL ENTITY 
STATUS Of CFR 1.9 (0 and 1.27 (c)) - SMALL BUSINESS CONCERN 



I hereby declare that I am 

| } the o*ner of the small business concern identified below: 

| an official of the small business concern empoweted to act on behalf of the concern identified below: 



NAME OF CONCERN _ 
ADDRESS OF CONCERN 



HlghTech Recept.or AB 



„c/a_ &C_ULve i M ai mo AB , Shor a N ygaU an 6T 



5-2J.1 -.37 Malmo^-Sweden 



I hereby declare that the above identified smalt business concern qualifies as a small business concern as defined in 13 CTR 
121 3 18, and reproduced in 37 CFR I 9 id), ft- r*"P«*« of pa>ing reduced fees under section 41(a) and <b) of Title 35. 
United States Code, in that the number of emj 1 ees of the concern, including those of its affiliates, does not e»ceed 500 
persons For purposes of this statement. (I) the number of employees of the business concern is the average over the previous 
fiscal vear of ihe concern of the persons employed on a full time, part lime or temporary basis during each of the pay periods 
of the fiscal year, and {2) concerns are affiliates of each other ~hen either, directly or indirectly, one concern controls or 
has the po*er to control the other, or a third party or parties controls or has the power to control both 

I hrrebv declare that rifttits under contract or law have been conveyed to and remain with the small business concern iden- 
tified above w,ih rr ? ard to the invention, entitled .PROTEIN T. AND HY BR ID PROTEINS -THEREOF- 

... by inventor!*) 

described in 



La r s BJ&RCK- e L -a-1- . 



I Xl ( he specifuaiion filed heiewith 
I I application serial no 
| | patent no 



issued 



filed October — 26 , 1 9 9 4 



IT the rights held b> the above identified small business concern are not esclusive. each individual, concern or organisation 
having rights to (be invention is luted below* and no rights to the invention are held by any person, other than the inventor, 
who could not qtiahfv as a small business concern under 37 CFR 1 9 (d) or by any concern which would not qualify as a 
vrnall busmen concern under *7 CF R 19 (d) or a nonprofit organwation under 37 CFR I 9 (e) 

*NO!F Separate verified statements are required from each named person, concern or organization 
having ngbrt to the invention averring to their status as small entities. (37 C FR I 2 7 ) 
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I acknowledge the dut> to file, in this application or patent, notification of any change in status resulting in loss of entitle 
mrnt to small entity status prior to p»)ing. or al the lime of pa>»ng. the earliest of the issue fee or any maintenance fee 
due after the date on which status as a small entity is no longer appropriate. (37 CFR 1.28 (b|) 

1 hereby declare that all statements made herein of my own knowledge are true and that all statements made on information 
and belief are believed to be true; and further that these statements were made with the knowledge that willful false statements 
and the Me so made are punishable by fine or imprisonment, or both, under section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application, any paieni issuing thereon, or 
any patent to which this verified statement is directed. 
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AAGCT TAAGGAGGTTA ATCG ATG AAA AAA ACT_GCT_ATC_GCT_ATC_GCT_GTT 
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1 f RBS 1 
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GCT CTG GCT GGT TTC GCT ACT GTT GCT CAG GCG GCG CCG AGA TCT 
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r 1 
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AAA CAG GAA TTC GAG CTC GGT ACC CGG GGA TCC TCT AGA GTC GAC 
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BINDING- SEQUENCE (EMPHASIZED WITH 
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